Finding The Sign Of A Function Value By Binary Cellular Automaton 



H. F. Chau*, Hao Xu, K. M. Lee, L. W. Siu and K. K. Yan 

Department of Physics, University of Hong Kong, Pokfulam Road, Hong Kong 

(February 8, 2008) 

Given a continuous function f(x), suppose that the sign of / only has finitely many discontinuous 
points in the interval [0, 1]. We show how to use a sequence of one dimensional deterministic binary 
cellular automata to determine the sign of f(p) where p is the (number) density of Is in an arbitrarily 
given bit string of finite length provided that / satisfies certain technical conditions. 
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I. INTRODUCTION 

Cellular Automaton (CA) is a simple local interac- 
tion model of natural systems used extensively in various 
fields of physics |fl]. Besides, CA can be regarded as a 
computation model motivated by biological phenomenon. 
Various inequivalent definitions of CA exist in the com- 
munity. In this paper, we restrict ourselves to consider 
the time evolution of CA to be governed by a local syn- 
chronous deterministic uniform rule. In other words, the 
state of each site in the next time step depends determin- 
istically only on the states of its finite neighborhood, the 
states of all sites are updated in parallel, and the trans- 
formation CA rule table is covariant under translation 
of the background lattice. In this respect, CA can be 
regarded as a decentralized deterministic parallel com- 
putation model without central memory storage. It is 
useful to study its power and limitation. In fact, a re- 
cent renewal interest in CA computing makes such an 
investigation timely. 

Since CA rules are local in nature, it is instructive to 
see if CA can perform tasks that involve global quan- 
tities. Perhaps the most well-known example is the so- 
called density classification problem. Considering a bit 
string of finite length N with periodic boundary condi- 
tions, the problem is to change all bits in the string to 
if the number of 0s is greater than the number of Is 
in the input bit string, and to change all bits to 1 if the 
number of Is is greater than the number of 0s. Clearly, 
the global quantity involved in this problem is the density 
of Is which is defined as the number of Is in the string 
divided by the string length N. 

Various CA rules have been proposed to solve the den- 
sity classification problem both by human ||] and by ge- 
netic algorithm Q|. But they only provide approximate 
solutions. In other words, these rules work for most but 
not all randomly chosen initial configurations. In fact, 
Land and Belew showed that density classification can- 
not be performed perfectly using a single one dimensional 
CA rule @). Later on, Capcarrere and his collaborators 



proved that a single CA rule can solve the density clas- 
sification and other related problems exactly if we mod- 
ify either the required output of the automaton or the 
boundary conditions |^,^| . However, in their approach, 
it has to scan through the states of all sites in the fi- 
nal configuration, in general, before knowing the answer. 
This requires global memories in the read out process. 
In contrast, read out in the original density classification 
problem can clearly be done by looking at the states of a 
few local sites, and hence can be done with an additional 
finite rule table whose size is independent of N. 

Recently, Fuks pointed out that the density classifica- 
tion problem can be solved exactly if we apply two CA 
rules in succession || . More precisely, he showed that by 
applying a CA rule a fixed number of times depending 
only on the lattice size and then followed by applying an- 
other CA rule a fixed number of times depending again 
only on the lattice size, the density classification prob- 
lem can be solved exactly. Fuks further asked if it is 
possible to classify a rational density. That is, he ques- 
tioned if it is possible to determine, using succession of 
CA rules, whether the density of Is in an arbitrary one 
dimensional array with periodic boundary conditions is 
less than, equal to or greater than a prescribed ratio- 
nal number p/q, called the critical density. Chau et al. 
answered his question affirmatively by showing that two 
CA rules are necessary and sufficient in solving the ratio- 
nal density classification problem exactly ||. His group 
further generalized their algorithm to find the majority 
state of an one dimensional array when each site may 
take on n possible states jio]]. In addition, further con- 
straints on the forms of certain C A rules that can be used 
in the rational density classification problem have also be 
reported JlTj ] . 

In Section [Dj we shall spell out the details of the prob- 
lem that we are interested in. Then after introducing 
some technical results in Section III, we shall solve the 



problem using a set of binary CAs with local synchronous 
deterministic uniform rules in Section IV. The general 



solution reported in Section IV can be simplified in a 
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number of special but important cases. We report the 
efficient solutions to these special cases in Section [v|. Fi- 
nally, we give a brief discussions on the significance of 
our result in Section Jv|. 



II. STATEMENT OF THE PROBLEM 

In this paper, we propose another generalization of the 
density classification problem, called the generalized den- 
sity classification problem (GDCP) and solve it using a 
sequence of CA rules. Before we discuss the significance 
of our generalization, let us first state our problem pre- 
cisely. 

Suppose we are given an input bit string a of length N 
in periodic boundary conditions, as well as a continuous 
function f(x) with finitely many discontinuous points of 
sgn(/(x)) in the interval [0, 1], where 



In addition to naturally generalize the density classi- 
fication problem, GDCP is interesting on its own right. 
First, by choosing 



sgn(y) = 



I if y >0, 
-1 if w < 0, 
ify = 0. 



(1) 



Moreover, those discontinuous points are rational and 
shall be denoted by Pi/q where pi,q G Z + U {0}, and 
i is the index for the discontinuous points. In addition, 
we have the freedom to choose the smallest positive q for 
the denominators of all the expressions Pi/q. With this 
choice, pi and q need not be relatively prime. Finally, we 
require that /(^) and /( g~ ) arc non-zero. 

We define the density of Is of the bit string a as the 
total number of Is in the bit string divided by the string 
length N and is denoted by the symbol p(a). We may 
simply write p instead of p(a) when it is clear from the 
content which bit string we are referring to. 

The task of GDCP is to evolve the bit string a to all 
1 (or 0) if sgn(/(p(a))) = 1 (or sgn(/(p(a))) = -1). Be- 
sides, we have to preserve the density of Is of the bit 
string if sgn(/(p(a))) = 0. In addition, by inspecting a 
fixed portion of the final output string, we are able to 
distinguish between the above three cases with certainty. 
(We shall consider a slightly relaxed variant to this def- 
inition later on in Section |v|.) To solve the GDCP by a 
sequence of binary CA rules, we mean that every initial 
configuration can be correctly classified by applying some 
binary CA rule Ri ti times followed by another binary 
CA rule R2 ti times and so on up to the /cth binary CA 
rule Rk tk times, where k is fixed and independent of 
the string length N. Besides, each CA rule Ri must be 
independent of the input a and has a bounded rule table 
size that is also independent of N. Finally, the number 
of iterations ti is independent of the initial configuration, 
but may depend on N. 

Clearly, the GDCP reduced to the rational density 
classification problem by choosing f(x) = x — p c where 
p c is a rational number. Moreover, the GDCP can be 
solved trivially if we have a global counter. 



x - pd if x < pd, 
f(x) = \ if pd < x < p c2 , 
x - p C 2 ifx> p c2 , 



(2) 



where p c \ < p C 2 both being rational, it determines if the 
density of Is is in the interval (p c i,p C 2) from a single 
copy of input bit string. (Note that it is more efficient to 
ask if p is between 1/10 and 4/39 rather than to ask i f p 
equals 0.10215 partly because of our result in Section IV 
that the former question requires smaller CA rule tables.) 
Thus, GDCP can be used to determine the coarse-grained 
density of Is efficiently, while in the rational density clas- 
sification problem, the resolution is infinite in the sense 
that we can only determine whether p > p c , p — p c or 
p < p Cl no matter how closed p is to p c . This reason 
alone is good enough to investigate the GDCP. Second, 
the GDCP has a more interesting flow diagram. While 
the rational density classification automata has two sta- 
ble and one unstable fixed points in the density of Is, the 
GDCP automata in general has at most two stable fixed 
points (p = 1 and p = 0), finitely many unstable fixed 
points (at where f(p) =0 but / is not identical zero lo- 
cally near p) and infinitely many neutral fixed points (at 
where f(p) = and / is identical zero locally near p) in 
the limit of N — > 00. 



III. SOME USEFUL CELLULAR AUTOMATON 
BUILDING BLOCKS 

Let us first report several essential building blocks of 
our solution to the GDCP. 



A. Car Hopping Automaton 

The first step in solving the problem is to make the 
density of Is uniform so as to make the local density of 
Is a good estimation of the global density of Is in the bit 
string. To measure the local density, we introduce the 
concept of /cth order local number. It is just the number 
of Is around the site we are interested. The CA rule 
will move the Is to the right according to the local den- 
sity gradient. If we repeatedly apply Hj, to an arbitrary 
bit string, eventually the density of Is of the resulting 
string will be more or less uniform. Of course we can- 
not expect that all local numbers are equal, but we can 
prove that the difference between local numbers of differ- 
ent sites can only be less than or equal to one. This CA 
rule is the most important CA rule in this paper. 

Definition 1 We denote a configuration in our bit string 
of length N, which can be identified with an one dimen- 
sional array of sites in periodic boundary conditions, by 
a and the state of the ith site by a(i). That is, a(i) is 
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the value of the ith bit in the string counting from left to 
right. We define the kth order local number of the ith site 
in our one dimensional array by n k (a, i) — Ylj=o 
where the sum i + j in the index here and elsewhere in 
this paper is understood to be modulo N . 

In other words, the kth order local number is the num- 
ber of sites with state 1 among the k consecutive sites 
starting from the site i. So, if we regard state as an 
empty site and state 1 as a site with a car, the kth order 
local number gives the local number of cars near a site. 
From now on, we shall interchangeably describe the state 
I as a car and the state as an empty site. Thus, the 
density of Is of a bit string can also be referred to as the 
car density. 

In what follows, we shall write n k (i) instead ofn k (ct,i) 
when there is no confusion on which bit string we are re- 
ferring to. 

Rule 1 (fcth order car hopping rule) Let us denote 
this CA rule by Hfe . The state of site i in the next time 
step H k [a](i) is given by: 

{0 if a(i) = 1, a(i + 1) = and 
n k (i - k+ 1) > n k (i + 1), 
1 ifa(i) = 0,a(i - 1) = 1 and (3) 
n k (i - k) > n k (i), 
a(i) otherwise. 

This CA rule can be rewritten in another form as follows: 
we interchange the states in site i and i + 1 if and only 
if a(i) — 1, a(i + 1) = and n k (i — k + 1) > n k (i + 1). 
All other states remain unchanged. All updates are taken 
in parallel. In what follows, we shall sometimes refer to 
such an interchange as hopping of a car. 

For example, H 4 (011001010010) = 010100101010. 

Note that the role of and 1 in the above CA rule is 
symmetric in the sense that it is invariant under the in- 
terchange of state and 1 plus the relabeling of i as — i. 
Following the spirit of particle physics, we shall refer to 
this property as the CP symmetry. Furthermore, the car 
density p is conserved under this CA rule. In fact one 
way to interpret this CA rule is that cars are driven to 
move from left to right by local car density gradient. Or 
equivalently, empty sites are driven to move from right to 
left by local empty site density gradient. Therefore, we 
expect that the fcth order local numbers in all recurrent 
states under the repeated iteration of Hfe is evenly dis- 
tributed. Nonetheless, a rigorous proof turns out to be 
rather involved partly because a car at site i cannot hop 
whenever another car is occupying site i + 1 even in the 
presence of a car density gradient. We now begin rigorous 
proof by first introducing a few technical lemmas. 

We expect that for any bit string a, the density of 
[3 = H l k (a) will be uniform if I is large enough. The first 
lemma shows that this is indeed the case when (3 is a 
fixed point of 



Lemma 1 The kth order local number n k (i) is equal for 
all i if and only ifH k (a) — a (that is, no car hops under 
the action of the kth order car hopping rule). 

Proof: If Hfe (a) a, then some car in the bit string a 
hops under the action of Hfe . So clearly n k (i) > n k (i + fc) 
for some i. 

Conversely, if Hfe (a) = a, then we may assume that a 
does not equal to all or all 1, for otherwise the Lemma 
is trivially true. Now locally we have any one of the fol- 
lowing three situations. 

Case (a) a(i — 1) = 1 and a(i) = 0: in this case we 
clearly require n k (i — fc) < n k (i) in order to prevent the 
car located at site i — 1 from hopping. 

Case (b) a(i) = ■ ■ ■ = a(i+j — 1) = 1 and a(i + j) = 
for some j > 0: in this case we again conclude that 
n k (i — k) < n k {i) for otherwise we have n k (i + j — fc) > 
n k {i+ j — k — !)>■■■> n k (i — k) > n k (i) > n k (i + 1) > 
• • • > n k(i+j) contradicting the assumption that the car 
located at site i + j — 1 docs not move. 

Case (c) a(i) — a(i — 1) = • • • = a(i — j) = and 
a(i—j — 1) = 1 for some j > 0: the CP symmetry of the 
CA rule Hfe implies that we can use the same trick as in 
the proof of case (b) to show that n k (i — k) < n k (i). 

In summary, we conclude that n k (i — fc) < n k {i) for 
all i; and since we are working in a finite bit string with 
periodic boundary, this is possible only when n k (i — k) = 
n k (i) for all i. 

Suppose that n k (i — fc + 1) > n k (i — k + 2) for some i, 
then a(i — fc + 1) = 1 and a(i + 1) = 0. However, this also 
implies that n k (i + 1) > n k (i + 2) and hence a(i + 1) = 1 
and a(i + k+l) = which is a contradiction. By the same 
token, it is absurd to have n k (i — k + 1) < n k (i — k + 2). 
Thus, the only possibility is that n k (i) = n k (i + 1) for all 
i and hence n k (i) — n k (j) for all i, j. □ 

Lemma [j] gives an important property of fixed points 
of Hfe. To characterize the property of cycles of with 
period greater than one, we need to work harder. In what 
follows, we show that whenever there is a fluctuation in 
the fcth order local number n k {i) in a bit string string a, 
at least one car in this string must hop under the action 
of the fcth order car hopping rule Hfe. 

Lemma 2 If n k {a,i — k + 1) > n k (a,i + 1), then there 
exists j with i — fc + 1 < j < i + k with n k (a, j — k + 1) > 
n k (a,i — k + 1) such that there is a car located at site 
j and this car hops by applying the rule Hfe. Moreover, 
a car will hop from site i to i + 1 in no more than k 
time steps. That is to say, there exists r < k such that 
Hfc _1 [a](i) = 1, HX _1 [a](i + 1) = and H£[a](i) = 0. 

Proof: We divide the proof into the following three cases. 

Case (a) a(i) = 1 and a(i + 1) = 0: we choose j = i as 
the car located at site i hops by applying Hfe. 

Case (b) a(i + l) = 1: since n k (i — fc+1) > n k (i + l), we 
can always find 1 < m < fc such that a(i + l) = a(i + 2) = 
■ ■ ■ = a(i + m) — 1 and a(i + m + 1) = 0. Then it is 
easy to see that n k (i + m — k + 1) > n k (i — k + 1) > 
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rijfe(i + 1) > n k (i + m + l). So the car located at site i + m 
hops to i + m + 1 when we apply Hence, we choose 
j = i + m. Inductively, in the next time step, we may 
choose j = i + m — 1 and so on. Consequently, in no more 
than fc time steps, a car must hop from site i to i + 1. 

Case (c) a(i) = a(i + 1) — 0: exploiting the CP sym- 
metry of the fcth order car hopping rule, this case can be 
proved in the same way as in case (b). □ 

We now introduce the fluctuation amplitude, which is 
the difference of the local numbers in different sites. It 
will be shown that the maximum fluctuation amplitude 
will not increase by the repeated application of 

Definition 2 Let a be a bit string of length N. We de- 
note Ak(a) — max{nfc(a,i) — n k (a,j) : 1 < i,j < N} 
the maximum fluctuation amplitude of the kth order lo- 
cal number. We write the maximum and minimum 
kth order local numbers of a bit string by n k ™ (a) = 

max{n/c(a,i) : 1 < i < N} and nj™ (ot) = 
mining (a, i) : 1 < i < N}, respectively. In addition, we 
denote the number of sites with kth order local number 
equals m by M k {a,m) — |{1 < i < N : n k {a,i) — m}\. 

For simplicity, we shall drop the label a in A k , n k m , 

n i mm ^ an d Mk when it is clear which bit string we are 
referring to in the text. 

Now, we are ready to show that repeated application 
of Hfc decreases the fluctuation amplitude of the fcth or- 
der local number to its minimum possible value in O(N) 
time steps. More precisely, we are going to prove the 
following lemma: 

Lemma 3 Let a be a bit string of length N. Then 

A fe (H" fc (fc) (a)) < 1 where i = £(k) = 2k(k - 2) \\ [£] ] . 

Besides, A k {R k ^ (a)) = if and only if p(a) = r/k for 
some r G Z. 

Proof: First of all, we show that if A k ([3) < 1, then 
A k ((3) = if and only if p{(3) = r/k for some r E Z. 
Since the fluctuation of n k {(3,i) is less than 2, n k ([3,i) 
can either be \kp{(3)\ or \kp([3)~\ . So, if p([3) — r/k for 
some r G Z, then A k ((3) = 0. Conversely, if p{(3) is not 
in the form r/k for some r E Z, then there exist i and 
j such that n k ([3,i) = \kp{(3)\ and n k (f3,j) = \kp([3)~\. 
Hence, A k ((3) = l. 

Since H k conserves car density, the last statement of 
this Lemma is proved by setting (3 = H e k (a). 

To prove the first part of this Lemma, we may fur- 
ther assume that (a) H k (a) H k ~ 1 (a) for all i < I and 
(b) A k (a) > 1. For if (a) does not hold, then Hf(a) 
is a fixed point of H k and hence Lemma [l] tells us that 
Afc(H^(a)) = 0. In addition, Eq. ||) tells us that it is not 
possible to increase the fcth order local number n k ((3,i) 
of a site i in a bit string /3 if n k ((3,i) equals n k max \f3). 
Hence, n k m&K \H k (a)) is a non-increasing function of i; 



and by CP symmetry of the fcth order car hopping rule, 
n[, mm ^ (H k (a)) is a non-decreasing function of i. As a 
result, the first part of this Lemma is trivially true if 
A fc (a) < I. 

Eq. (0) also tells us that for any bit string /3, 
n k {R k {p),i) — n k ((3,i) + 1 if and only if a car in /? 
hops from site i — 1 to i while no car in (3 hops from 
i + k ~ ltoi + fc under the action of H^. Besides, 
rifc(Hfc(/3), i) — n k ((3, i) — 1 if and only if a car in (3 hops 
from site i + k — 1 toi + fc while no car in (3 hops from 
i — 1 to i under H k . Finally, n k {R k {(3), i) — n k ([3,i) 
if and only if cither no car in (3 hops from sites i — 1 
and i + k — 1 or cars in (3 hop from both sites i — 1 and 
i + k — 1. We deduce from this observation together with 
Lemma Jj that at least one site with fcth order local num- 
ber n k m hops under H^. Consequently, the number of 
cars with maximum value of fcth order local number does 
not increase with time. In other words, 

M k (H k (f3), 4 max) 03)) < M k (f3, 4 max) 09)) (4) 

and the equality holds if and only if the number of 

sites i with n k ((3,i) = n k (/?) and n k (H k (f3),i) = 

n i maX ^ (P) ~ 1 equals the number of sites j with n k (/?, j) = 

4 max) 09) - 1 and n fc (H fc (/3),j) = n k m ™\[3). Eq. © de- 

mands that n k ((3, j — fc) = n k max \(3) and hence no car can 
hop to the site j — fc, so we also have n k (H. k ([3),j — fc) = 

n k max \[3) — I. Therefore, the strict inequality holds in 
Eq. (0) if there exits a site i such that n k (/3, i — k + 1) = 
4 m< ^W n k {[3,% + I) < 4 max) (/3) - 1, (3(i) = I and 
/3(* + l) = 0. 

Since we assume that A k (a) > 2, we claim that we can 
always find 1 < i < N and j > such that n k max ^ {(3) = 

rifcOM-fc+l) >4 max) (/3)-l =n k (P,i + l) = n k ((3,i + 
fc + 1) = ••• =n k ((3,i+jk + l) > n k (f3,i + (j + l)k + l). 
Otherwise, suppose that for some i", n k {(3, i" + mk + l) is 
equal to either n k max \f3) or n k max \f3) — I for all m G Z. 
Hence, the density p satisfies n k max \/3) — 1 < kp < 
n k naK \(3). By the assumption that A k (a) > 2, there 

must be some i' such that n k ([3, i' + 1) < n k max \(3) — 2. 
The conclusion on the density forces that for some m G Z, 
n k ((3,i' + mfc + I) = n k max \(3). That is the claim. 

Lemma || tells us that a car will hop from site i to i + 1 
and hence rtfc(i — fc + 1) is reduced in no more than fc 
time steps. That is, there exists r with 1 < r < k such 
that n k (W k ({3),i -fc + 1) < n k (f3, i — k + 1). Besides, 
n fc (Hj;O0),i + 1) = n k max) (f3) if j > 2. Inductively, in 
no more than a = fc [^-] time steps, number of sites 
with local number equals n k mayi \(3) must be reduced by 
at least one. In other words, M k (H%((3), n k mayi) (f3)) < 
M k (J3, n[ max ' ) ((3)). In the above discussion, it is clear that 

a packet of cars with fcth order local number n k maK \[3) 
initially located around site i is moving from left to 
right at a speed of at least 1 site per time step until it 
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hops into a region with fcth order local number less than 
n ™ (/?) — 1. Thus, this packet of cars may prevent an- 
other packet of fcth order local number n[ max ' ) cars in its 
left hand side from moving at most fc times. Inductively, 
if Ufe(/3, i — rk + 1) = n£ nax ^ (0) , then a car will begin to 
hop from site i — rk to i — rk + 1 in no more than rk 
time steps. After that, this packet of cars will move at 
a speed of at least 1 site per time step from left to right 
provided that of the configuration is still greater than 
1. By CP symmetry, a similar conclusion can be drawn 
for sites with fcth order local number equals to n!™ 1 . 

, maxl 

■n\ 

fmin) 



> 



l 



ft/2 + 1/2 
2. 



Since a site with fcth order local number n[. max ' , and an- 



other site with fcth order local number 



separate by 



at most TV — 1 sites, so the above discussions imply that in 
at most 2fc |~i |~-^] ] time steps, A& is reduced by at least 
one until A fc < 1. Hence, within I = 2fc(fc - 1) \\ \f] ] 
time steps, A^ < 2. □ 

Combining Lemmas |l| and ||, we know that the max- 
imum fluctuation amplitude in car density evolves to- 
wards the minimum possible value under the repeated 
action of We remark that relaxation time estimate 
I in the above Lemma is rather conservative. It is not 
difficult to reduce this estimation by a factor of two or 
more. Nevertheless, for the purpose of solving the GDCP, 
the present estimation which states that £ — O(kN) for 
fc < N is already enough. 

Our investigation so far can be summarized in the fol- 
lowing theorem. 

Theorem 1 Let a be a bit string of length N and I = 
i(k) = 2fc(fc-2) \\ \f ] ] . Then the bit string (3 = H|(a) 
has the following properties: 

(a) The kth order local number nk(/3,i) is equal to ei- 
ther \kp\ or [ftp] ; 

(b) Suppose that H fe (/3) ^ 0. If p(a) < 1/2, then 
(3 does not contain two consecutive Is as its substring. 
Similarly, if p(a) > 1/2, then [3 does not contain two 
consecutive Os as its substring. Thus, if p(a) < 1/2, a 
car at site i in the string (3 hops to the right under 

if and only if \kp~\ = rik(i — k + 1) > n^{i + 1) = [fcpj . 
Similarly, if p > 1/2, an empty site i in the string f3 hops 
to the left if and only if [ftp] — n^{i — k) > nk(i) = \_kp\ ■ 

Proof: We only need to prove (b) as (a) is already con- 
tained in the proof of Lemma y. 

To prove (b), we use part (a) of this Theorem. It tells 
us that nk(f3,i) equals either [fcpj or [ftp]. We denote 
the minimum distance between two successive cars in the 
bit string (3 by d, that is, d = min{? — j : [3{i) = 1, (3{j) = 
1 and rii-j(f3,i) — 1}. Since Afc = 1, Lemmas |l| and || 
imply that fcp ^ Z. Hence, the maximum distance be- 
tween |~fcp] cars in the string f3 is greater than or equals 
to fc + 1. Thus, d [ftp] > fc + 1. For p < 1/2, 



d> 



(5) 



[fcpl 



As a result, we conclude that f3 does not contain the 
substring 11 if p < 1/2. By CP symmetry, a does not 
contain the substring 00 whenever p > 1/2. 

The remaining assertion of part (b) follows directly 
from Eq. (|). □ 



B. Separation Automaton 

With the GDCP as stated, we need to distinguish 
the two cases, namely, (1) a = N and /(0) ^ 0, and 
(2) < p(a) and f(p(a)) > 0. In order to do so, we need 
to tell if the bit string a is equal to N or not. It can 
be done using the following automaton Sk together with 
Hfe. In a similar way, we also need to distinguish the case 
of a = l N and a ^ \ N . It can be done using a conjugate 
automaton Sfc. These two automata and are col- 
lectively known as the fcth order separation automaton. 



Rule 2 (fcth order separation rule) We denote this 
CA rule by Sk- The state of site i in the next time step 
Sk[a](i) is given by: 

1 if a(i — fc) = 1 and nk{i — fc) 

SfeN(i) = { =1, (6a) 

a(i) otherwise. 

We denote its conjugate CA rule by Sk- That is, 

if a(i) = and rik(i — fc) = 
S,. [..](/)= { ft - L (6b) 

a(i) otherwise. 



Theorem 2 Let a be a bit string of length N . Then 
p(0) = O ifp(a)=0, 
\<p{P)<l */0<p(a)<i, (7a) 

(3 = 7 and p((3) = p(a) otherwise, 

where (3 = o H|(a) and 7 = H|(a). Similarly, 

p(S) = l ifp(a) = l, 

^ < P{6) <^ if ^ < p(a) < 1, (7b) 
(5 = 7 and p(8) — p(a) otherwise, 

where 8 = sj. o H e k (a). 
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Proof: If p(a) — 0, then a = N (this notation denotes 
N consecutive 0s and we shall use similar notations for a 
bit string) and hence (3 = N and p([3) — 0. 

If p(a) > 1/k, then Theorem [j] tells us that n k (7, i) > 
for all i. Hence, from Eq. (|6a|), (3 = 7 and p(/3) = p(a). 

Finally, if < < 1/fc, then Theorem |l| implies 

that nk(j,i) is equal to either or 1 for all i. More- 
over, at least one bit in the string 7 is 1. Thus, it is 
straight forward to check that applying to 7 makes 

1 < n k (/3, i) < 2 for all i and 1/k < p([3) < 2/k_ 

The proof for the conjugate separation rule S k is sim- 
ilar. □ 



Thus, S 



LJV/fej 



■H^ separates the string from the rest 



oHj(q) has 



in the sense that no string in the form St W ' ^ 
a density of Is in the interval (0, 1/k). This is the reason 
why we call S k the fcth order separation automaton. 



C. Inversion Automaton 

This CA rule and the next one (the exchange automa- 
ton) are two technical CA rules to transform the bit 
string to the form we desired. They are needed to cor- 
rectly solve the GDCP for input strings 0^ and 1^. 

Rule 3 (fcth order inversion rule) We denote this 
CA rule by Ik ■ The state of site i in the next time step 
I k [a](i) is given by: 



!*[«](*) = 



1 ifn k (i) = 0, 
a(i) otherwise. 



We denote its conjugate CA rule by l k . In other words, 



Ik [a_ 



](*) = 



ifn k (i) = k, 
a(i) otherwise. 



(8b) 



A direct consequence of Theorem and Eqs. 
is: 



Theorem 3 Let a be a bit string of length N and (3 = 
I fe oHf(a). Then p(f3) = 1 if p(a) = 0, and p(0) = p{a) 
if p(a) > 1/k. In fact, the bit string H e k (a) is a fixed 
point of l k provided that p(a) > 1/k. Similarly, if 
7 = Ifc oHj(a), then p(fy) = if p{a) — 1, and H^.(a) is 
a fixed point of I& if p(a) < (fc — l)/k. 



The following theorem is a direct consequence of Theo- 
rem and Eq. (§): 

Theorem 4 Let a be a bit string of length N . Then [3 = 
E fc o H{(a) = 1 N if a = N , (3 = N if a = 1^. More- 
over, H e k (a) is a fixed point of~E k "if 1/k < p < (k~l)/k. 



E. Function Automaton 

The two automata introduced in this subsection are 
derived from the continuous function in question. What 
we want it to achieve in the first automaton F/ is that 
after applying it once, there will be a substring of 2q 
or l 2q according to the sgn(/(p)) provided that p is not 
in the form r/q for some r£l This substring will be 
further manipulated by the next automaton. The second 
automaton Ff is used to classify a bit string with p in 
the form r/q for some r G Z. 

Rule 5 (function rule) Let f{x) be the continuous 
function specified in GDCP. We denote the discontinuous 
points in [0, 1] by Pi/q, where pi, q G Z + U {0}. (Here we 
choose q to be the smallest positive integral denominators 
of all the expressions Pi/q.) Recall in Section ^ that we 
require /(gj) and /( ^~ ) to be non-zero. We denote the 
function CA rule associated with f(x) by Ff. The state 
of the site i in the next time step Ff[a](i) is given by: 



F, [«](*) 



where 



ifn 4q (i-j) = 2k + l 
for some < j < 2q — 1, 
k G Z and ^ 0, 



(10a) 



a(i) 



9(x) = 



otherwise, 



1 if x > , 
if x <Q. 



(10b) 



In addition, we denote the associated function CA rule 
of f(x) by Ff. The state of site i in the next time step 
is given by: 



F/ [«K0 = 



*(/(£)) ifn 2q (i) = 
and f(- k 



a(i) 



-2q 

otherwise. 



k,0<k<2q, 
1 ? 0, (10c) 



D. Exchange Automaton 

Rule 4 (fcth order exchange rule) We denote this 
CA rule by . The state of site i in the next time step 
Efc[a](z) is given by: 

( 1 ifn k (i) = 0, 
E k [a](i)=l ifn k (i)=k, (9) 
I a(i) otherwise. 



Theorem 5 We use the notations in the function rule. 
Let a be a bit string of length N. Then [3 = F f o H^a) 
equals N if a = , (3 — 1 N if a — 1 N . More im- 
portantly, if l/2q < p(a) < (2q — l)/2q and p(a) is 
not in the form r/2q for some integer r, then f3 con- 
tains the substring 2q if and only if f(p(a)) < while 
(3 contains the substring l 2q if and only if f(p(a)) > 0. 
Furthermore, (a) is a fixed point ofFf provided that 
either f(p(a)) = or p(a) is in the form r/2q for some 
r = l,2,...,2g-l. 



G 



Let a be a bit string of length N and (3 = F f o 2q 
(a) The strings N and 1 N are fixed points of F t o 



2q o F f o H^^ 9 '' . Moreover, (3 — Q N provided that 



< ,0(0:) < l/2q and /(i) < 0. Similarly, = 1 



N 



provided that (2q - l)/2q < p(a) < 1 and /(^p) > 0. 



Proof: Clearly and 1 are fixed points of both Theorem 7 We use the notations of the function rule. 
H2 g and Ff. Moreover, Theorem [l] demands that 
U2 q (ti-\ q {a) , i) — r for all i provided that p(a) — r/2q 
for some r — 1, 2, . . . , 2q — 1. Thus Eq. (|3|) implies in 
this case that Fl\ q (a) is a fixed point of F/. Similarly, 
Eq. (10a) implies that H^a) is a fixed point of F/ pro- 
vided that f(p(a)) = 0. So it remains for us to show the 
validity of this Theorem when \/2q < p(a) < (2q — l)/2q 
and p(a) is not in the form r/2q for some integer r. 

Suppose p(a) is not in the form r/2q for some inte- 
ger r and l/2q < p{a) < (2q - l)/2q. Theorem [TJ to- 
gether with the fact that periodic boundary conditions 
are imposed on the string a imply that we can always 
find i and j such that n2q(H.2 q (a), i) — n 2q(H-2q( a )^ * + 
2q) + 1 and n 2q (Ki q (a),j) = n 2g (H^( a ),j + 2q) - 1. 
Hence, n4 g (H2 ? (a), i) and r^^H^a), j) are odd num- 
bers. Since all the discontinuous points of sgn(/) in 
the interval [0,1] are in the form Pi/q, so for each 
fixed r = 0, 1, . . . , 2q - 1, sgn(/(x)) = sgn(/(yV) for all 
x,y G {r/2q, (r + l)/2q). Therefore, from Eq. (10a), ei- 
ther n 2q {F f o H^(a),i) or n 2q (F f o H| g (a),i) equals 



2q if f(p(a)) > 0. Besides, either ri2 q (Ff oHya),i) 
or n2q(F f o H2 g (a),j) equals if f(p(a)) < 0. As 
l/2q < p(a) < (2q - l)/2q, Theorem^] tells us that 
1 < [2qp(a)\ < n 2q (Hi(q),i) < [2qp(a)l < 2q - 1 for 
all i. Hence Eq. ( |10a| ) implies that it is not possible for 
the string (3 to contain both 2q and 1 2|J as substrings. 
That is to say, the function rule maps a portion of H^a) 
to l 2q if and only if f(p(a)) > and similarly it maps a 
portion of H^a) to 2q if and only if f(p(a)) < 0. □ 

Theorem 6 We use the notations of the function rule. 
Let a be a bit string of length N . Then (3 = F f o H^a) 
equals N if p(a) = r/2q for some r = 1, 2, . . . , 2q— 1 and 
f{r/2q) < 0. Similarly, (3 — 1 N if p(a) = r/2q for some 
r = 1, 2, . . . , 2q - 1 and f(r/2q) > 0. Moreover, H e 2q (a) 

is a fixed point of Ff if (a) a 
(c) < p(a) < 1 and f{p(a)) = 0. 



N , or (b) a = 1 



N 



Proof: Direct application of Theorem |l| and Eq. (10c). □ 



F. Propagation Automaton 

After applying the function automaton F^-, there will 
be a substring of 2q or l 2q provided that p is not in the 
form r/q for some r £ Z and f(p)^0. To convert the 
whole bit string to the desired form, we have to propagate 
the substring to everywhere. 



(b) Suppose l/2q < p{a) < (2q - l)/2q. Then [3 = N 
if and only if f(p(a)) < 0. Moreover, (3 = 1 . if and 
only if f(p(a)) > 0. Finally, f3 — Hj 9 (a) and hence 
p((3) = p(a) if and only if f(p(a)) = 0. 

Proof: To show that 0^ and 1^ are fixed points of 
Ff o P^~ 2q o Ff o H| 9 is straight forward. Suppose 
< p(a) < l/2q and f(l/2q) < 0. Then, Theorem 
implies that n2q(H 2q (a), i) < 1 with equality holds for 
some i. Since f(l/2q) < 0, so by continuity of / and the 
distribution of discontinuous points of sgn(/), we know 
that f(x) < whenever < x < 1/q. Hence FjoH^(a) 
contains the substring 2q and does not contain the sub- 
string l 2q . Thus by Eq. (|ll| ) and Theorem [| we con- 
clude that (3 = N . The proof of (3 = 1 N provided that 
(2q - l)/2q < p{a) < 1 and /(^-) > is similar. So, 
we have proved the validity of part (a). 

We know from Theorem [l| that H|„ (a) is a fixed point 
of P f provided that l/2q < p(a) < {2q - l)/2q. There- 
fore, part (b) of this Theorem follows directly from The- 
orems |[ H and Eq. (|l]) . □ 



IV. GENERALIZED DENSITY CLASSIFICATION 
AUTOMATA 



After introducing all the necessary building blocks in 
the last section, we are ready to report our solution of 
the GDCP. 

Theorem 8 Let a be a bit string of length N. Let f(x) 
be a continuous function. Denote the rational discontin- 
uous points of sgn(f(x)) by Pi/q with Pi,q G Z + U {0}. 
(Here q is chosen to be the smallest positive integral de- 
nominator for the expressions Pi/q.) Suppose further that 
f(^-) and /( %~ ) are both non-zero, then the following 
sequence of CA rules solves the GDCP for the function 
/(*)•• 



Rule 6 (propagation rule) Let a be a bit string. We 
denote the propagation rule by P f . The effect of ' P f 
each bit is given by 



c f = F f o pj- 2q o F/ o (E 2g r 5 o (i 2g ) Q4 o (i 2q r o 



on 



H 



2<; 



•2c, 



J ° {^q 0b 2 9 



1 ifn 2q {i + 1) = 2q 
*](%)={ if n 2q (i + 1) = 0, 
a(i) otherwise. 



H 



l(2q) 
2q J 



(12) 



(11) 



wht 



1 = 1 if /(^) > 0, al = otherwise; a2 = 1 



if /(^) < 0, a2 = otherwise; a3 = 1 if /(0) > 
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and /(l) > 0, a3 = otherwise; a4 = 1 if /(l) < 
and /(0) < 0, a4 = otherwise; a5 = 1 if /(O) > and 
/(l) < and a5 = otherwise. 

Before we prove this theorem, let us first informally 
explain how it works. The H^ 2 ^ in the right-most part 
of Eq. ( |i~2| ) makes the density of car uniform. Then the 

(il 2q 2qS> o sJj^' 29 "' *j term first separates p = from < 

p < l/2q and then makes the resultant bit string uniform 
in car density whenever /(^) and hence also f(x) > 

for all < x < l/2q. Similarly, the oSlf 2,J j° 2 

term first separates p — 1 from (2q — l)/2q < p < 1 
and then makes the resultant bit string uniform in car 
density whenever /( |~ ) and hence also /(x) < for 
all (2g — l)/2q < x < 1. Next, notice that a3 and 
a5 will not simultaneously equal to one, and similarly 
a4 and a5 will not simultaneously equal to one, the 

(E 2q ) a5 o (l-2q) a ° (l2g) a3 term correctly deals with the 
GDCP for p = and 1. This is followed by the term 
pj~ 2q oFf which correctly classifies those p not in the 

form r/2q. Finally, the term Ff settles the remaining 
case of p in the form r/2q with r / or 2q. 

Proof: We divide the proof into the following four cases: 
Case (a): p(a) = or 1, that is, a — Q N or 1 N : in this 
case, Cf — Ff o pj- 2q oFjo (E 2 ,) a5 o (T 2g ) a4 o (I 2q ) 



a 3 



So by Theorems |and |, 0/(0^) = FjoP^oF/fi") 



where x = sgn(/(0)) and similarly C/(l 



>N-2q 



F f o 



i f o Ff(y ) where y = sgn(/(l)). Hence, from 
Eqs. flLOaj ), @) and @, the theorem holds for p(a) = 
or 1. 

Case (b): l/2q < p(a) < (2q-l)/2q: in this case, The- 
orems || and I tell us that C f = P/oPj-^oPyoH^. 
Hence, this case is settled by applying Theorem | 

Case (c): < p(a) < l/2q: in this case, the continuity 
of / together with our assumptions that /(l/2g) ^ and 
discontinuous points of sgn(/(x)) are in the form rjq for 
some r € Z demand that / can be further divided into 
the following two subcases, namely, (1) f{x) > for all 
< x < 1/q, and (2) f(x) < for all < x < 1/q. 

In subcase (1), al = 1. Theorem | tells us that 
l/2q < p(S^ /2ql o H l 2q (a)) < 1/q. From Theorem § we 



know that C/(a) = F f o F^ 2q oF/o o S 



{ e(2q) n a lN/2q] 



H e 2q 2q) {a). Combining with Theorem 0, C/(a) = 1 JV = 
(sgn(f(p))) N . 

In subcase (2), the continuity of / implies that al = 
a3 = a5 = and hence C/(a) = F/ o P^~ 29 o F/ o 

H^'^a). So, applying Theorem ^ gives C/(a) = 0^ = 
(sgn(/(p))) w . 

Therefore, Eq. ( |l2| ) correctly classifies bit string with 
< p < l/2q. 



J 2g 



Case (d): (2q - l)/2g < p(a) < 1: The proof of this 
case is similar to that of case (c) and we are not going to 
write the details here. 

In summary, C/(a) = N if f(p) < 0, C/(a) = 1 N 

if f{p) > 0, and C f (a) = H e 2q 2q \a) if f{p) = 0. Conse- 
quently, the result of the GDCP can be read out from any 
2q consecutive sites of the final output bit string. More 
precisely, if we find that such substring equals 2q , then 
either f(p) < or p = and /(0) = 0. If such substring 
is l 2q , then either f(p) > or p = 1 and /(l) = 0. Lastly, 
if such substring contains both and 1, then p ^ 0, p ^ 1 
and f{p) = 0. □ 



V. SIMPLE SOLUTIONS TO CERTAIN SPECIAL 
CASES 

Theorem | provides a solution to the GDCP involving 



all the nine automata introduced in Section III . Nonethe- 



less, it does not mean that solution of any GDCP have 
to be that complicated. In this section, we report simple 
solutions to certain useful GDCPs. Comparing to the 
GDCP, solutions to the following problems are rather 
straight-forward. So the presentation in this section is 
brief. 



A. Cases Related To Rational Density Classification 

The simplest non-trivial case of GDCP is the rational 
density problem which chooses f(x) = x — p c for a fixed 
rational number < p c < 1. Chau et al. solved this 
problem by the following two automata || . 

Rule 7 (modified traffic rule) Let a be a bit string of 
length N and p c — p/q where p,q £ Z + with p and q are 
relatively prime. Then the modified traffic rule is give by 



T p [a](i) 



1 if a(i) = 0, a(i — 1) = 1, and 

n q -i(i) <p-l, 
ifa(i) = l,a(i + 1) = 0, and (13) 
n q -i(i + 1) < p- 1, 



a(i) otherwise. 



Rule 8 (Modified Majority Rule) Let a be a bit 

string of length N and p c = p/q where p,q € Z + with 
p and q are relatively prime. Then the modified majority 
rule is give by 



M p [a](i) = 



1 ifn 2q+1 (i 
otherwise. 



q) >2p+l, 



(14) 



More precisely, they showed that M^/ 2 ^- 1 )! 

T rW(ma X (g,2p)-l) roax (g-p,p)/pgl+g-2 solyes the rationa l 

density classification problem with f{x) = x — p c . 
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By applying I q , I q or E g once to the resultant state 
of the rational density classification automata, it is clear 
that we can solve a number of related problems using 
three CAs, including f{x) = p c — x, f(x) = (x — p c ) 2 and 

f(x) = -{X- p c ) 2 . 



B. Cases Related To Coarse-Grained Rational 
Density Classification 

Another interesting and useful problem we have briefly 
mentioned in Section [| is the so-called approximate ra- 
tional density classification. Let us recall that if p c = p/q 
is a rational number, then solution to the rational den- 
sity classification problem either by the method reported 
in Section IV or in Ref. M requires a rule table whose 



range scales linearly with q. Thus, it is more cost effective 
to classify a coarse-grained rational density. To achieve 
this task, we are required to solve the GDCP with the 
function / given by Eq. (||) with p c \ < p c < p C 2- An 
effective way to choose p c \ and p c i is to use the contin- 
ued fraction approximation of p c . If the output string is 
1^, we know that p is greater than p C 2 and hence also 
p c . Similarly, we know that p < p c if the output is 0^. 
If pd < p < p C 2, then we do not know if the p is greater 
than p c or not. Fortunately, p is preserved by the au- 
tomata in this case and hence we may feed our output 
bit string to say the full rational density classification 
CAs for fine-grained density determination. In this way, 
we can efficiently solve the rational density classification 
problem with small rule table size with high probability 
for a randomly given input bit string. Performing a CA 
which is a function of output of another CA leads us to 
the notion of CA programming. We shall explore the 
power and weakness of CA programming elsewhere |l2j . 

After pointing out the significance of the coarse- 
grained rational density classification problem, we re- 
port a solution involving only two CA rules. We write 
Pel = Pi/q and p C 2 = P2/9 as usual. The first CA rule 
is our car hopping automaton H 9 . (Clearly we cannot 
use T as we have two critical densities here.) The second 
CA rule is a variation of the propagation rule, as stated 
below. 

Rule 9 (modified propagation rule) Let a be a bit 

string. Then the modified propagation rule is given by 

1 if n q (i + j) > P2 for some 
1 < j < q, 

*](*) = { ifn q [i + 1) < pi for some 
1 < j < Q, 
a(i) otherwise. 

(15) 

Clearly, if p > p C 2, there is a site i such that 
n q (H. q (a),i) > p2- Thus, p increases to I under the re- 
peated application of P Pl , P2 ,q- Similarly, p decreases to 



Pi,P2,qi 



under the repeated application of ~Pp t , V2> q if p < p c i- In 
summary, we know that Ppf^lg H^ 9 -* solves the coarse- 
grained rational density classification problem. 

In a similar way, problems with f{x) — (x — p c \)(x — 
Pc2), f{x) = (x — p c i)(x — p C 2) 2 and so on can be solved 
using four CAs. 



C. Variation Of The Theme 

Because of the difficulties in distinguishing between 
the strings 0^ and O^ -1 !, we introduce the separa- 
tion automaton Sfc. Unfortunately, is not car den- 
sity conserving. This is precisely the reason why we 
impose the technical conditions that f{X/2q) 7^ and 
f((2q — l)/2q) ^ 0. We remark that the above tech- 
nical conditions can be waived provided that we relax 
the GDCP a bit. Instead of requiring that the den- 
sity of Is of the output string (3 equals to that of the 
input string a if /0(a)) — 0, we replace it by requir- 
ing that \f{p{0)) - /0(a)) I < l/q- By doing so, it is 
clear that the GDCP can be solved even when f(l/2q) 
or f((2q — l)/2q) = by the following sequence of au- 
tomata: 



F / op^oF / o(E 2o r 5 o(i 29 ) Q4 o(i 29 r 3 

M 2 ? ° h 2 9 ° ^2q ° b 2q ° H 2q ) 



(16) 



where a3 = 1 if /(0) > and /(l) > 0, a3 = other- 
wise; a4 = 1 if /(l) < and /(0) < 0, a4 = otherwise; 
a5 = 1 if /(0) > and /(l) < and a5 = otherwise. 

However, we do not encourage the use of this relaxed 
definition of GDCP for the density of Is is not preserved 
in case f(p) = 0. 



VI. DISCUSSIONS 

A few remarks are in order. First, Theorem || pro- 
vides a CA solution to the GDCP with at most eight CA 
rules each with rule table of range < 4g. (Note that the 
number eight comes from the observation that we can 

combine F/ o (E2 9 )° 5 o (T-2 q ) a ° (l2g) a3 together to form 
one single CA rule whose rule table size is still indepen- 
dent of the string length N.) Moreover, the total run 
time required scales as 0(qN) whenever q < N, making 
it asymptotically optimal up to a constant factor. The 
main ingredient used to solve the GDCP is the fcth order 
traffic rule whose repeated application leads to a uni- 
formly distributed Is in the bit string in the sense that 
fluctuation of the fcth order local number rik(i) does not 
exceed 1. 

We stress that the solution to the GDCP using a se- 
quence of CAs is not unique. We have also found that 
a slightly different traffic rule involving 2k + 1 sites to- 
gether with a generalized majority vote rule based on the 
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work in Ref. || can also do the job. Nevertheless, this 
alternative method requires in general more than eight 



CA rules in succession |13 



supported in part by the Outstanding Young Researcher 
Award of the University of Hong Kong. 



We are not sure if the GDCP can be solved using fewer 
than eight CA rules. What we know from the result of 
Fuks is that density classification, being a special case 
of GDCP, cannot be solved using a single CA rule ||. 
Although certain special cases of the GDCP, such as the 
original density classification problem can be solved us- 
ing two CA rules H|), we feel that it is highly unlikely 
to solve the full GDCP using just two CA rules because 
of the difficulties involved in separating the cases with 
p close to and p equals although we have provided 
solutions of the rational density classification and coarse- 
grained rational density classification problems using two 
CA rules in Section 

Finally, we remark that the CA solution to the GDCP 
shows a rich flow diagram. The flow of p under the action 
of C f exhibits at most two stable fixed points at p = 
and 1 corresponding to f(p) ^ 0, finitely many fixed 
points corresponding to the rational isolated roots of the 
equation f(x) = 0, together with possibly infinitely many 
neutral fixed points corresponding to the remaining non- 
isolated zeros of f(x) =0 in the limit of N — + oo. In this 
respect, sequence of CAs may lead to very interesting 
flow diagrams that are distinctive from those resulting 
from conventional continuous dynamical systems. 
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